Goto

Collaborating Authors

 Bentonville


Caught in the Act: a mechanistic approach to detecting deception

Boxo, Gerard, Socha, Ryan, Yoo, Daniel, Raval, Shivam

arXiv.org Artificial Intelligence

Sophisticated instrumentation for AI systems might have indicators that signal misalignment from human values, not unlike a "check engine" light in cars. One such indicator of misalignment is deceptiveness in generated responses. Future AI instrumentation may have the ability to detect when an LLM generates deceptive responses while reasoning about seemingly plausible but incorrect answers to factual questions. In this work, we demonstrate that linear probes on LLMs internal activations can detect deception in their responses with extremely high accuracy. Our probes reach a maximum of greater than 90% accuracy in distinguishing between deceptive and non-deceptive arguments generated by llama and qwen models ranging from 1.5B to 14B parameters, including their DeepSeek-r1 finetuned variants. We observe that probes on smaller models (1.5B) achieve chance accuracy at detecting deception, while larger models (greater than 7B) reach 70-80%, with their reasoning counterparts exceeding 90%. The layer-wise probe accuracy follows a three-stage pattern across layers: near-random (50%) in early layers, peaking in middle layers, and slightly declining in later layers. Furthermore, using an iterative null space projection approach, we find multitudes of linear directions that encode deception, ranging from 20 in Qwen 3B to nearly 100 in DeepSeek 7B and Qwen 14B models.


Graph Kolmogorov-Arnold Networks for Multi-Cancer Classification and Biomarker Identification, An Interpretable Multi-Omics Approach

Alharbi, Fadi, Budhiraja, Nishant, Vakanski, Aleksandar, Zhang, Boyu, Elbashir, Murtada K., Mohammed, Mohanad

arXiv.org Artificial Intelligence

The integration of multi-omics data presents a major challenge in precision medicine, requiring advanced computational methods for accurate disease classification and biological interpretation. This study introduces the Multi-Omics Graph Kolmogorov-Arnold Network (MOGKAN), a deep learning model that integrates messenger RNA, micro RNA sequences, and DNA methylation data with Protein-Protein Interaction (PPI) networks for accurate and interpretable cancer classification across 31 cancer types. MOGKAN employs a hybrid approach combining differential expression with DESeq2, Linear Models for Microarray (LIMMA), and Least Absolute Shrinkage and Selection Operator (LASSO) regression to reduce multi-omics data dimensionality while preserving relevant biological features. The model architecture is based on the Kolmogorov-Arnold theorem principle, using trainable univariate functions to enhance interpretability and feature analysis. MOGKAN achieves classification accuracy of 96.28 percent and demonstrates low experimental variability with a standard deviation that is reduced by 1.58 to 7.30 percents compared to Convolutional Neural Networks (CNNs) and Graph Neural Networks (GNNs). The biomarkers identified by MOGKAN have been validated as cancer-related markers through Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis. The proposed model presents an ability to uncover molecular oncogenesis mechanisms by detecting phosphoinositide-binding substances and regulating sphingolipid cellular processes. By integrating multi-omics data with graph-based deep learning, our proposed approach demonstrates superior predictive performance and interpretability that has the potential to enhance the translation of complex multi-omics data into clinically actionable cancer diagnostics.


Revisiting Le Cam's Equation: Exact Minimax Rates over Convex Density Classes

Shrotriya, Shamindra, Neykov, Matey

arXiv.org Machine Learning

We study the classical problem of deriving minimax rates for density estimation over convex density classes. Building on the pioneering work of Le Cam (1973), Birge (1983, 1986), Wong and Shen (1995), Yang and Barron (1999), we determine the exact (up to constants) minimax rate over any convex density class. This work thus extends these known results by demonstrating that the local metric entropy of the density class always captures the minimax optimal rates under such settings. Our bounds provide a unifying perspective across both parametric and nonparametric convex density classes, under weaker assumptions on the richness of the density class than previously considered. Our proposed `multistage sieve' MLE applies to any such convex density class. We further demonstrate that this estimator is also adaptive to the true underlying density of interest. We apply our risk bounds to rederive known minimax rates including bounded total variation, and Holder density classes. We further illustrate the utility of the result by deriving upper bounds for less studied classes, e.g., convex mixture of densities.


Why 'Autonomous' Vehicles Will Still Need a Human Minder

WSJ.com: WSJD - Technology

The delivery drivers of the future may not leave a package at your door. Instead, they'll be sitting several miles or even time zones away in a control room overseeing a fleet of delivery robots or drones. A look at how innovation and technology are transforming the way we live, work and play. Companies are plowing billions of dollars into autonomous technologies they hope will improve efficiency and solve worker shortages. But executives in these industries say true autonomy is many years away–and may never come.


Sam's Club Deploys Inventory Scanning Robots Chainwide

#artificialintelligence

BENTONVILLE, Ark. and SAN DIEGO, Ca.--Sam's Club has completed a roll out of inventory scanning towers that have been added to its existing fleet of floor scrubbing robots. The club store had started adding the inventory towers to its robots in January, and now all locations have the technology added. "Our initial goal at Sam's Club was to convert time historically spent on scrubbers to more member-focused activities. Our autonomous scrubbers have exceeded this goal. In addition to increasing the consistency and frequency of floor cleaning, intelligent scrubbers have empowered associates with critical insights," said Todd Garner, vice president, in-club product management.


COFAR: Commonsense and Factual Reasoning in Image Search

Gatti, Prajwal, Penamakuri, Abhirama Subramanyam, Teotia, Revant, Mishra, Anand, Sengupta, Shubhashis, Ramnani, Roshni

arXiv.org Artificial Intelligence

One characteristic that makes humans superior to modern artificially intelligent models is the ability to interpret images beyond what is visually apparent. Consider the following two natural language search queries - (i) "a queue of customers patiently waiting to buy ice cream" and (ii) "a queue of tourists going to see a famous Mughal architecture in India." Interpreting these queries requires one to reason with (i) Commonsense such as interpreting people as customers or tourists, actions as waiting to buy or going to see; and (ii) Fact or world knowledge associated with named visual entities, for example, whether the store in the image sells ice cream or whether the landmark in the image is a Mughal architecture located in India. Such reasoning goes beyond just visual recognition. To enable both commonsense and factual reasoning in the image search, we present a unified framework, namely Knowledge Retrieval-Augmented Multimodal Transformer (KRAMT), that treats the named visual entities in an image as a gateway to encyclopedic knowledge and leverages them along with natural language query to ground relevant knowledge. Further, KRAMT seamlessly integrates visual content and grounded knowledge to learn alignment between images and search queries. This unified framework is then used to perform image search requiring commonsense and factual reasoning. The retrieval performance of KRAMT is evaluated and compared with related approaches on a new dataset we introduce - namely COFAR. We make our code and dataset available at https://vl2g.github.io/projects/cofar


Insite AI raises $19M to help consumer brands figure out their in-store strategies

#artificialintelligence

Insite AI, a platform for consumer packaged goods companies that uses AI to provide recommendations on how to price, distribute and promote their products in physical stores, today launched out of stealth with $19 million in Series A capital from NewRoad Capital and M12, Microsoft's corporate venture arm. Co-founder Shaveer Mirpuri says that the funding will be put toward customer onboarding, building a team of industry experts to help shape product initiatives and an expanded feature set. Mirpuri and Jonathan Reid co-launched Insite with the belief there was a large addressable market for brick-and-mortar sales revenue growth management software. It's true retailers -- and by extension, brands -- face considerable challenges in this area, particularly as the economy takes a precipitous turn. According to NPD, more than 80% of U.S. consumers said in May that they'd rein in product spending within the next three to six months.


CSforALL Urges Greater Focus on AI and Data Science

#artificialintelligence

If you're not in the know, artificial intelligence and data science may sound like especially nerdy subsets of the already pocket-protector infused field of computer science. But anyone who is serious about expanding computer science education--a list that includes Fortune 500 company CEOs and policymakers on both sides of the aisle--should be thinking carefully about emphasizing AI, in which machines are trained to perform tasks that simulate some of what the human brain can do, and data science, in which students learn to record, store, and analyze data. That means making sure kids have access to well-designed resources to learn those subjects, bolstering professional development for those who teach them, exposing career counselors to information about how to help students pursue jobs in those fields, and much more. That imperative is at the heart of a list of recommendations by CSforALL, an education advocacy group presented last month at the International Society for Technology in Education's annual conference. Leigh Ann DeLyser, CSforALL's co-founder and executive director, spoke with Education Week about some big picture ideas around the push for a greater focus on AI and data science within computer science education.


Why Computer Science Classes Should Double Down on AI and Data Science

#artificialintelligence

If you're not in the know, artificial intelligence and data science may sound like especially nerdy subsets of the already pocket-protector infused field of computer science. But anyone who is serious about expanding computer science education--a list that includes Fortune 500 company CEOs and policymakers on both sides of the aisle --should be thinking carefully about emphasizing AI, in which machines are trained to perform tasks that simulate some of what the human brain can do, and data science, in which students learn to record, store, and analyze data. That means making sure kids have access to well-designed resources to learn those subjects, bolstering professional development for those who teach them, exposing career counselors to information about how to help students pursue jobs in those fields, and much more. That imperative is at the heart of a list of recommendations by CSforALL, an education advocacy group presented last month at the International Society for Technology in Education's annual conference. Leigh Ann DeLyser, CSforALL's co-founder and executive director, spoke with Education Week about some big picture ideas around the push for a greater focus on AI and data science within computer science education.


AI/ML, Data Science Jobs #hiring

#artificialintelligence

Johnson & Johnson (J&J) is an American multinational corporation founded in 1886 that develops medical devices, pharmaceuticals, and consumer packaged goods. Its common stock is a component of the Dow Jones Industrial Average and the company is ranked No. 36 on the 2021 Fortune 500 list of the largest United States corporations by total revenue.